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Abstract 

^ , Bulatov (2008) gave a dichotomy for the counting constraint satisfaction problem #CSP. A 

' problem from #CSP is characterised by a constraint language F, a fixed, finite set of relations 

over a finite domain D. An instance of the problem uses these relations to constrain an 
arbitrarily large finite set of variables. Bulatov showed that the problem of counting the 
satisfying assignments of instances of any problem from #CSP is either in polynomial time 
(FP) or is #P-complete. His proof draws heavily on techniques from universal algebra and 
cannot be understood without a secure grasp of that field. We give an elementary proof 
u. of Bulatov's dichotomy, based on succinct representations, which we call frames, of a class 

(/3 I of highly structured relations, which we call strongly rectangular. We show that these are 

, precisely the relations which are invariant under a Mal'tsev polymorphism. En route, we 

give a simplification of a decision algorithm for strongly rectangular constraint languages, 
■"sj" ' due to Bulatov and Dalmau (2006). We establish a new criterion for the #CSP dichotomy, 

^ \ which we call strong balance, and we prove that this property is decidable. In fact, we 

' establish membership in NP. Thus, we show that the dichotomy is effective, resolving the 

QQ \ most important open question concerning the #CSP dichotomy. 

cn 

^ ! 1 Introduction 

O . 

The constraint satisfaction problem (CSP) is ubiquitous in computer science. Problems in such 
diverse QifGclS clS Boolean logic, graph theory, database query evaluation, type inference, schedul- 
ing and artificial intelligence can be expressed naturally in the setting of assigning values from 
some domain to a collection of variables, subject to constraints on the combinations of values 
5^ \ taken by given tuples of variables [17]. CSP is directly equivalent to the problem of evaluating 

conjunctive queries on databases [22] and to the homomorphism problem for relational struc- 
tures [l7j. Weighted versions of CSP appear in statistical physics, where the total weight of 
solutions corresponds to the so-called partition function of a spin system [16] , 

For example, suppose we wish to know if a graph is 3-colourable. The question we are trying 
to answer is whether we can assign a colour (domain value) to each vertex (variable) such that, 
whenever two vertices are adjacent in the graph, they receive a different colour (constraints). 
Similarly, by asking if a 3-CNF formula is satisfiable, we are asking if we can assign a truth 
value to each variable such that every clause contains at least one true literal. 
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Since it includes both 3-COLOURABILITY and 3-SAT, this general form of the CSP, known as 
uniform CSP, is NP-complete. Therefore, attention has focused on nonuniform CSP. Here, we 
fix a domain and a finite constraint language F, a set of relations over that domain. Having fixed 
r, we only allow constraints of the form, "the values assigned to the variables vi, . . . ,Vr must 
be a tuple in the r-ary relation R G F" (we define these terms formally in Section [2]). We write 
CSP(r) to denote nonuniform CSP with constraint language F. To express 3-COLOURABILITY 
in this setting, we just take F to be the disequality relation on a set of three colours. 3-SAT is 
also expressible: to see this, observe that, for example, the clause -ix V y V -iz corresponds to 
the relation {t,f}^ \ {t,f,t}, where t indicates "true" and f "false", and that the other seven 
patterns of negations within a clause can be expressed similarly. 

Thus, there are languages F for which CSP(F) is NP-complete. Of course, we can also 
express polynomial-time problems such as 2-Colourability and 2-Sat. Feder and Vardi [17j 
conjectured that these are the only possibilities: that is, for all F, CSP(F) is in P or is NP- 
complete. To date, this conjecture remains open but it is known to hold in special cases p!1l20p26] . 
Recent efforts to resolve the conjecture have focused on techniques from universal algebra |12j . 

There can be no dichotomy for the whole of NP, since Ladner j23] has shown that either 
P = NP or there is an infinite hierarchy of complexity classes between them. Hence, assuming 
that P 7^ NP, there exist problems in NP that are neither complete for the class nor in P. 
However, it is not unreasonable to conjecture a dichotomy for CSP, since there are NP problems, 
such as graph Hamiltonicity and even connectivity, that cannot be expressed as CSP(F) for any 
finite F. This follows from the observation that any set S of structures (e.g., graphs) that is 
definable in CSP has the property that, if A E S" and there is a homomorphism B ^ A, then 
B S; neither the set of Hamiltonian nor connected graphs has this property. Further, Ladner's 
theorem is proven by a diagonalisation that does not seem to be expressible in CSP [17]. 

In this paper, we consider the counting version of CSP(F), which we denote #CSP(F). Rather 
than ask whether an instance of CSP(F) has a satisfying assignment, we ask how many satisfying 
assignments there are. The corresponding conjecture was that, for every F, ^CSP(F) is either 
computable in polynomial time or complete for #P. We give formal definitions in the next 
section but, informally, #P is the analogue of NP for counting problems. Again, a modification 
of Ladner's proof shows that there can be no dichotomy for the whole of ^P. Note that the 
decision version of any problem in N P is trivially reducible to the corresponding counting problem 
in #P: if we can count the number of solutions, we can certainly determine whether one exists. 
However, the converse cannot hold under standard assumptions about complexity theory: there 
are well-known polynomial-time algorithms that determine whether a graph admits a perfect 
matching but it is ^^P-complete to count the perfect matchings of even a bipartite graph [28]. 

Dichotomies for #CSP(F) are known in several special cases jl01linn"31ll5pi6j . each consistent 
with the conjecture that #CSP(F) is always either polynomial-time computable or #P-complete. 
However, Bulatov recently made a major breakthrough by proving a dichotomy for all F [2l[3]. 

Bulatov's proof makes heavy use of the techniques of universal algebra. A relation is said 
to be pp-definable over a constraint language F if it can be defined from the relations in F by a 
logical formula that uses only conjunction and existential quantification. Geiger [19] showed that 
an algebra can be associated with the set of pp-definable relations over F and Bulatov examines 
detailed properties of the congruence lattice of this algebra0 The structure of quotients in 
this lattice must have certain algebraic properties, which can be derived from tame congruence 
theory [21] and commutator theory [18j. Bulatov constructs an algorithm for the polynomial- 

^We will not define these terms from universal algebra, as they are not needed for our analysis. 
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time cases, based on decomposing this congruence lattice and using the structure of its quotients. 
However, he is only able to do this, in general, by transforming the relation corresponding to 
the input instance to one which is a subdirect power. It is even nontrivial to prove that this 
transformation inherits the required property of the original. His paper runs to some 43 pages 
and is very difficult to follow for anyone who is not expert in these areas. The criterion of 
Bulatov's dichotomy is based on infinite algebras constructed from T and was not shown to be 
decidable. It also seems difficult to apply it to recover the special cases mentioned above. 

Our main results are a new and elementary proof of Bulatov's theorem and a proof that 
the dichotomy is effective. Thus, we answer, in the affirmative, the major open question in [3]. 
We follow Bulatov's approach by working with the relation over F determined by the input, 
but we require almost no machinery from universal algebra. The little that is used is defined 
and explained below. We develop a different criterion for the #CSP dichotomy, strong balance, 
which is based on properties of ternary relations definable in the constraint language. We show 
that it is equivalent to Bulatov's congruence singularity criterion. 

Using strong balance, we construct a relatively simple iterative algorithm for the polynomial- 
time cases, which requires no algebraic properties. In fact, the bound on the time complexity of 
our counting algorithm is no worse than that for deciding if the input has satisfying assignments. 

We then use our criterion to prove decidability of the ^CSP dichotomy. We show that 
deciding strong balance is in NP, where the input size is that of T. Of course, complexity is not 
a central issue in the nonuniform model of #CSP, since T is considered to be a constant. It is 
only decidability that is important. However, the complexity of deciding the dichotomy seems 
an interesting computational problem in its own right. 

1.1 Our proofs 

Our proofs are almost entirely self-contained and should be accessible to readers with no knowl- 
edge of universal algebra and very little background in CSP. We use reductions from two previous 
papers on counting complexity, by Dyer and Greenhill |16j and by Bulatov and Grohe [.Sj. We 
also use results from Bulatov and Dalmau [6], but we include short proofs of these. The pa- 
pers [6l[8] deal partly with ideas from universal algebra, but we make no use of those ideas. We 
use only one idea from universal algebra, that of a Mal'tsev polymorphism. This will be defined 
and explained in Section [2] below. 

The proof is based around a succinct representation for relations preserved by a Mal'tsev 
polymorphism. We call such relations strongly rectangular for reasons which will become clear. 
Our representation is called a frame, and is similar to the compact representation of Bulatov and 
Dalmau [4j. Frames are smaller than compact representations, since they avoid some redundancy 
in the representation. 

We define a frame for a relation R C D" to be a relation F (1 R with the following two 
properties. First, whenever R contains a tuple with ith component a, F also contains such a 
tuple. Second, for 1 < i < n say that a set /S C L) is i-equivalent in i? if contains tuples which 
agree on their first i — 1 elements and whose ith. elements are exactly the members of S. Any 
set that is i-equivalent in R must also be i-equivalent in F, but note that there may be several 
common prefixes for S in R when only one is required in F. We show that every n-ary strongly 
rectangular relation over D has a small frame of cardinality at most \D\n, whereas R may have 
cardinality up to \D\'^. Further, we show how to construct such a frame efficiently and how to 
recover a strongly rectangular relation R from any of its frames. 

Now, suppose we have an instance ^ of 7^CSP(r) for some strongly rectangular constraint 
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language T, with m constraints in n variables. Using methods similar to those of Bulatov and 
Dalmau 0], we construct a frame for the solution set of <I> in polynomial time, by starting with a 
frame for L)" and introducing the constraints one at a time. A frame is empty if, and only if, it 
represents the empty relation so, at this point, we have re-proven Bulatov and Dalmau's result 
that there is a polynomial-time algorithm for the decision problem CSP(r) for any strongly 
rectangular constraint language T. We give an explicit time complexity for this algorithm, which 
is 0{mn^) for fixed T. Bulatov and Dalmau [1] gave no time estimate, showing only that their 
procedure runs in polynomial time. 

Any ternary relation i? C j4i x x ^3 (where the Ai need not be disjoint) induces a matrix 
M = (nixy) with rows and columns indexed by Ai and A2 and with 

m^y = \{z : ix,y,z) £ R}\ . 

We say that R is balanced if M's rows and columns can be permuted to give a block-diagonal 
matrix in which every block has rank one, and that a relation R C Z?" for any n > 3 is balanced 
if every expression of it as a ternary relation in Z)'^ x x D"^ {k,£,m > 1, k + £ + m = n) is 
balanced. A constraint language T is strongly balanced if every relation of arity three or more 
that is pp-definable relation over T is balanced. Via a brief detour through weighted ^^CSP, we 
show that #CSP(r) is #P-complete if F is not strongly balanced. 

If r is strongly balanced, we compute the number of satisfying assignments to a CSP(r) 
instance as follows. Let R C be the set of satisfying assignments. First, we construct a small 
frame F for R, as above. If R is unary, we have F = R so we can trivially compute \R\. 

Otherwise, for 1 < i < j < n, let Nij{a) be the number of prefixes ui . . .Ui such that there 
is a tuple ui . . . n„ G i? with Uj = a. In particular, then, summing the values of Nn-i^n{—) gives 
\R\. Since the functions Nij can be calculated easily from the frame, we just need to show how 
to compute Nij for each j > i, given Ni^ij for each j > i. Writing [k] for the set {1, . . . , k}, we 
can consider the set prjjjyjjji? to be a ternary relation on pr[j_]^]i? x pr^i? x prjR. R is strongly 
balanced so the matrix given by Mxy = |{u : (u, x,y) € prfjjuijj-R}! is a rank-one block matrix 
and the sum of the a-indexed column of the matrix is Nij{a). 

By taking quotients with respect to certain congruences, we obtain another rank-one block 
matrix M, whose block structure and row and column sums we can determine. A key fact about 
rank-one block matrices is that this information is sufficient to recover the entries of the matrix. 
This allows us to recover M and, hence, compute the values Nij{a) for each j and a. Iterating, 
we can determine the function Nn-i^n and, hence, compute \R\. 

Finally, we show that the strong balance property is decidable. Our proof of decidability 
rests on showing that, if F is not strongly balanced, then there is a counterexample with a 
number of variables that is only polynomial in the size of F. We do this by reformulating the 
strong balance criterion for a given formula as a question concerning counting assignments in 
a formula derived from ^. This reformulation enables us to apply a technique of Lovasz |2^ . 
The technique further allows us to recast strong balance in terms of the symmetries of a fixed 
structure, that is easily computable from F. We are thus able to show that deciding strong 
balance is in NP, where the input size is that of F. 

1.2 Organisation of the paper 

The remainder of the paper is organised as follows. Preliminary definitions and notation are given 
in Section [2l In Section [3l we define the notion of strong rectangularity that we use throughout 
the paper and, in Section HI we further study the properties of strongly rectangular relations and 
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introduce frames, our succinct representations of such relations. We give an efficient procedure 
for constructing frames in Section [3 In Section [6l we introduce counting problems and, in 
Section [71 we define the key notion of a strongly balanced constraint language and prove that 
7^CSP(r) is solvable in polynomial time if F is strongly balanced and is #P-complete otherwise. 
In Section [HI we show that our dichotomy is decidable, in fact in the complexity class NP. Some 
concluding remarks appear in Section [H 

2 Definitions and notation 

In this section, we present the definitions and notation used throughout the paper. We defer to 
Section [8] material relating to certain classes of functions that are used only in that section. 
For any natural number n, we write [n] for the set {1, . . . , n}. 

2.1 Relations and constraints 

Let D = {di, d2, ■ ■ ■ ,dq} be a finite domain with q = \D\. We will always consider ^ to be a 
constant and we assume that g > 2 to avoid trivialities. A constraint language F is a finite set 
of finitary relations on D, including the binary equality relation {{di,di) : i € [q]}, which we 
denote by =. We will call & = {D,T) a relational structure. We may view an r-ary relation H 
on D with i = \H\ as an £ x r matrix with elements in D. Then a tuple t £ H is any row of this 
matrix. We will usually write tuples in the standard notation, for example {ti,t2, ■ ■ ■ ,tr). For 
brevity, however, we also write tuples in string notation, for example, tit2 ■ ■ - tr, where this can 
cause no confusion. 

If R is an n-ary relation and i = (ii, . . . , i^,) are distinct elements of [n], we write pr;i? for the 
projection of R on i, the relation containing all tuples (a^^, . . . ,aij.) such that (ai, . . . ,0^) G R 
for some values of the aj where j ^ i. For / C [n], we write pr jR as shorthand for pr;i?, where i 
is the enumeration of Fs elements in increasing order. For the relation {t}, where t is a single 
n-tuple, we write prjt rather than pr;{t}. 

We define the size of a relation H as \\H\\ = ir, the number of elements in its matrix, and 
the size of F as ||F|| = Yliii^r 11-^11- '^^ avoid trivialities, we will assume that every relation 
iif £ F is nonempty, i.e. that ||-ff|| > 0. We will also assume that every d € D appears in a tuple 
of some relation E F. If this is not so for some d, we can remove it from D. It then follows 
that ||F|| > q. 

Let V = {i'i,i'2, ■ ■ ■ ^T^n} be a finite set of variables. An assignment is a function x: 1/ — )• 
D. We will abbreviate x(fj) to Xj. If {ii, Z2, . . . , ir} C [n], we write H{xi^,Xi^, . . . ,Xi^) for 
the relation © = {x : (xi^,Xi2, . . . G H} and we refer to this as a constraint. Then 

(fjj, j/jj, . . . , i^v) is the scope of the constraint and we say that x is a satisfying assignment for 
the constraint if x £ 0. 

A T-formula <I> in a set of variables {xi,X2, . . . , Xn} is a conjunction of constraints ©i A • • • A 
©m. We will identify the variables with the Xi above, although strictly they are only a model 
of the formula. Note that the precise labelling of the variables in <I> has no real significance. 
A formula remains the same if its variables are relabelled under a bijection to any other set of 
variable names. 

A F-formula <1> describes an instance of the constraint satisfaction problem (CSP) with con- 
straint language F. A satisfying assignment for <I> is an assignment that satisfies all ©j (i G [m]). 
The set of all satisfying assignments for <I> is the T-definable relation i?$ over D. We make no 
distinction between <I> and unless this could cause confusion. 
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2.2 Definability 



A primitive positive (pp) formula ^ is a F-formula $ with existential quantification over some 
subset of the variables. A satisfying assignment for ^ is any satisfying assignment for $. The 
unquantified (free) variables then determine the pp-definable relation Ri^/, a projection of 
Note that any permutation of the columns of a pp-definable relation is, itself, pp-definable. 
Again, we make no distinction between ^ and R^. 

The set of all F-definable relations is denoted by CSP(r) and the set of all pp-definable 
relations is the relational clone (F). If F = {H, =}, we just write (H). An equivalence relation 
in (F) is called a congruence. 

2.3 Polymorphisms 

A k-ary polymorphism of F is any function ip: — )• D, for some k, that preserves all the 
relations in F. By this we mean that, for every r-ary relation H £ F and every sequence 
ui, . . . , Ufc of r-tuples in H, 

V'(ui,U2, . . . ,Ufc) = (-0(^1,1, . . . ,Mfc,l), -0(^^1,2, • • ■,Uk,2), ■0(wi,r, • • ■,Uk,r)) G H . 

It is well known that any polymorphism of F preserves all relations in (F) (see Lemma (H). 

A Mal'tsev polymorphism of F is a polymorphism tp: — )■ D such that, for all a,b £ D, 
ip{a, b, b) = ip{b, b, a) = a. (So, in particular, (/?(a, a, a) = a.) We will usually present calculations 
using ip in a four-row table. The first three rows give the triple of "input" tuples ti,t2,t3 
and the fourth gives the "output" 93(ti, t2, ts). For example, the table below indicates that 
</?(au, ov, 6w) = (6, 99(u, V, w)). 

a u 
a V 

b w 

b V9(u, V, w) . 

2.4 Complexity 

For any alphabet S, we denote by FP the class of functions /: S* — )• N for which there is a 
deterministic, polynomial-time Turing machine that, given input x G S* writes f{x) (in binary) 
to its output tape. #P is the class of functions f : Ti* —?■ N for which some nondeterministic, 
polynomial-time Turing machine has exactly f{x) accepting computations for every input x € S*. 

Completeness for #P is defined with respect to polynomial-time Turing reductions [29J, also 
known as Cook reductions. For functions f,g: S* — )• N, a polynomial-time Turing reduction from 
f to g is a polynomial-time oracle Turing machine that can compute / using an oracle for g. A 
function / £ #P is ^P- complete if there is a Cook reduction to / from every problem in ^^P. 

The class ^^P plays a role in the complexity of counting problems analogous to that played 
by NP in decision problems. Note, however, that, subject to standard complexity-theoretic 
assumptions, #P-complete problems are much harder than NP-complete problems. Toda has 
shown that P"^^ includes the whole of the polynomial-time hierarchy [27], whereas P'^'^ is just 
the hierarchy's second level. 
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3 Rectangular relations 



A binary relation B Ai x A2 is called rectangular if (a, c), {a,d), {b,c) E B implies (6, d) G B 
for all a,b £ Ai, c,d G A2. We may view B as an undirected bipartite graph Qb, with vertex 
bipartition Ai, A2 and edge set Eb = {{01,02} : (01,02) € -B}. Note that we do not insist 
that Ai n A2 = ^ but, if a S ^1 n ^2, o is regarded as labelling two distinct vertices, one in Ai 
and one in A2. Formally, Ai and A2 should be replaced by the disjoint vertex sets {1} x Ai 
and {2} x A2 but this would unduly complicate the notation. We will assume that pr^B = Ai 
{i = 1,2), so that Qb has no isolated vertices. The connected components of Qb will be called 
the blocks of B. 

Rectangular relations have very simple structure. 

Lemma 1. If B is rectangular, Qb comprises k bipartite cliques, for some k < min{|Ai|, |^2|}- 

Proof. Let k be the number of connected components of Qb- Every vertex is included in an edge 
so A; < min{|yli|, 1^21}- Consider any component C and suppose it is not a bipartite clique. Let 
o G AiCiC, z € A2^C be such that {a, z\ ^ Eb- Thus, a shortest path in C from a to 2: has length 
at least 3. If o, 6, c, d are the first four vertices on such a path, then {a, 6}, {&, c}, {c, d} € Eb-, 
but {a, d} ^ Eb as, otherwise, there would be a shorter path from a to 2. But this is equivalent 
to (o, 6), (c, 6), (c, d) € B and (a, d) ^ i?, contradicting rectangularity. □ 

Where appropriate, we do not distinguish between B and Qb- For example, we will refer to 
a connected component of Qb as a block. 

Corollary 2. The relations 

0i{xi,X2) = 3y {B{xi,y) A B{x2,y)) and 02{yi,y2) = {B{x,yi) A B{x,y2)) 

are equivalence relations on priB, pr2B respectively. The equivalence classes of 9i and 62 are in 
one-to-one correspondence. 

Proof. The blocks of B induce partitions of Ai and A2 which are in one-to-one correspondence. 
These clearly define the equivalence classes of 61 and 62. □ 

Corollary 3. If T is a constraint language and B € (F) is rectangular, then the relations 61 
and O2 of Corollary \M ere congruences in (F) . 

Proof. Since B has a pp-definition, so too do 9i and 62- □ 

We say that a relation R C for n > 2 is rectangular if every expression of i? as a binary 
relation in x (1 < < n) is rectangular. We call a constraint language F strongly 

rectangular if every relation B € (F) of arity at least 2 is rectangular. If C D"' is a relation, 
we say that it is strongly rectangular if {R) is strongly rectangular. If i? S (F) for a strongly 
rectangular F, then R is strongly rectangular, since {R) C (F). 

From the definition, it is not clear whether the strong rectangularity of F is even decidable, 
since (F) is an infinite set. However, it is decidable, as we will now show. The following result is 
usually proven in an algebraic setting. That proof is not difficult, but requires an understanding 
of concepts from universal algebra, such as free algebras and varieties [12j. Therefore, we will 
give a proof in the relational setting. Moreover, we believe that this proof will provide rather 
more insight for the reader whose primary interest is in relations. 

First, we require the following lemma, which is well-known from the folklore; we provide a 
proof for completeness. 
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Lemma 4. (p is a polymorphism of T if, and only if, it is a polymorphism of (F) . 

Proof. Let be a polymorphism of T and let R E (P). We prove that (p is a polymorphism of 
R by induction on the structure of the defining formula of R. The base case, atomic formulae 
{H{x) for relations H gT) is trivial. 

Suppose R is defined by 3y ^/;(x, y). If ai, a2, as G R, then there are bi, 62, 63 such that aj6j G 
ip {i G {1,2,3}). If is a polymorphism of ip, then it follows that cd = 99(ai6i, a262, as^s) E ■0, 
which means that c G i?, as required. 

Finally, suppose R is defined by V'(x) A If ai, a2, as G i?, then a^ G V' H x for each i. If 

(p is a, polymorphism of ifj and of x then c = (/?(ai, a2, as) G ^ x and, therefore, c G i2. 

Conversely, F C (F) so every polymorphism of (F) is a polymorphism of F. □ 

Lemma 5. A constraint langauge F is strongly rectangular if, and only if, it has a Mal'tsev 
polymorphism. 

Proof. Suppose F has a Mal'tsev polymorphism p. Consider any pp-definable binary relation 
B C X D^. By Lemma [H p is also a polymorphism of B. If (a, c), (a, d), (b, d) G B then 
we have ((/^(a, a, b), (/?(c, d, d)) = (b,c) G B, from the definition of a Mal'tsev polymorphism. 
Thus, B is rectangular and hence F is strongly rectangular. 

Conversely, suppose F is strongly rectangular. Denote the relation H £T by H = {u^ : i G 
[■^j?]}) where G D^^. Consider the F-formula 

^(^) = A A A A - 

where x^^^^^ is an r/^-tuple of variables, distinct for all G F, ^1,^2,^3 G [in]- Thus, the 

relation i?$ has arity r$ = ^h&v'^h^^ and = Hz/Gr 

Clearly R<^ has three tuples ui, U2, U3 such that the sub-tuple of Uj corresponding to x^^^^ 
is uf. for each j G {1,2,3} and each ii, 22,^3 G [in]- Then U = {ui,U2,U3} has the following 
universality property for F. For all -ff G F and every triple of (not necessarily distinct) tuples 
ti, t2, t3 G H, there is a set I = /(ti,t2,t3) with / C [r$], \I\ = rn such that prj-i?$ = H and 
prjUj = ti {i = 1,2,3). 

Now, for each set of identical columns in U, we impose equality on the corresponding variables 
in <!•, to give a F-formula <^'. Let U' be the resulting submatrix of U, with rows u'^, Ug, Ug. 
Observe that U' is obtained by deleting copies of columns in U. Therefore U' has no identical 
columns and has a column (a, 6, c) for all a,b,c (z pr^-fT with G F and k G [rn]- 

Next, for all columns (a, 6, c) of C/' such that b ^ {a, c}, we impose existential quantification 
on the corresponding variables in <!>', to give a pp-formula Let U" be the submatrix of U' 
with rows u", U2, Ug corresponding to u'^, Uj, Ug. Then [7" results from deleting columns in U' 
and U" has columns of the form (a, a, 6) or (c, d, d). Thus, after rearranging columns (relabelling 
variables), we will have 





fun 




" a 


c " 


u" = 






a 


d 








_b 


d_ 



for some nonempty tuples a, b, c, d so, by strong rect angularity, u" = [b c] G R^". 

Removing the existential quantification in <I>", u" can be extended to u' G R^'. Now, if 
column k of U' is {a,b,c) say, we define ip{a,b,c) = u^. This is unambiguous, since U' has no 
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identical columns. Thus, u' = (^(u'^^, Ug, Ug) G R^'. If, for any a,b,c G D, {p{a,b,c) remains 
undefined, we will set ip{a, b,c) = a unless a = 6, in which case ip{a, b, c) = c. Clearly satisfies 
ip{a, 6, b) = ip{b, b, a) = a, for all a,b £ D, and so has the Mal'tsev property. 

Removing the equalities between variables in u' can be further extended to give the tuple 
u = 99(ui,U2,U3) G This is consistent since u satisfies the equalities imposed on $ to 

give Now, for any ti,t2,t3 G H, the universality property of U implies that, for some /, 
prju = 99(ti,t2,t3) G H. Thus, 93 preserves all H £ T, so it is a polymorphism and hence a 
Mal'tsev polymorphism. □ 

Remark 1. Observe that the proof of Lemma [5] uses all the elements of pp-definability. Thus, if 
LemmaOis to hold true, the definition of strong rectangularity cannot be significantly weakened. 

Remark 2. The proof of Lemma [5] is constructive and, hence, implies an algorithm for deciding 
whether T is strongly rectangular and, if so, determining a Mal'tsev polymorphism 99. However, 
we describe a more efficient method in Lemma [5] below. 

Note that strong rectangularity is invariant under permutations of the columns of a relation, 
both by Lemma [5] (since permutations of columns do not affect Mal'tsev polymorphisms) and 
by the fact that permutations are pp-definable. We will use this fact repeatedly and consider a 
relation R C for some n > 2 to be a binary relation on x D^-^ or a ternary relation on 
j-)k X X D"-f'-'''^ for any appropriate values of k and £. 

In the algebraic setting, the result corresponding to Lemma [5] is that (L) has a Mal'tsev 
polymorphism if, and only if, T is congruence permutable. See, for example, \12l- This has 
the following meaning. If pi and p2 are congruences on a pp-definable set A C D^, define the 
relational product "0 = Pi ° P2 by 'ip{^, y ) = 3z (x(z) A pi (x, z) A /O2 (z, y)) , where x is the formula 
defining A. Then pi, p2 are permutable if V'(u, v) implies ip{'v, u) for all u, v G A or, equivalently. 
Pi ° P2 = P2 ° Pi- Now r is congruence permutable if every pair of congruences on the same set 
A is permutable. For completeness, we will prove the following. 

Lemma 6. T is strongly rectangular if, and only if, it is congruence permutable. 

Proof. Suppose F is strongly rectangular. If pi, p2 are congruences on a pp-definable set A C D^, 
let ■0 be the relational product, as defined above. Clearly ip is a pp-definable binary relation on 
D^. Then, if (u, v) G ip, we have (u, u), (u, v), (v, v) G ip, since pi and p2 are congruences. But 
this implies (v, u) G since ip is rectangular. Thus, F is congruence permutable. 

Conversely, if F is congruence permutable, consider a pp-definable relation B C x D^. 
Define a relation ~i on B by (xi,yi) ~i (x2,y2) if, and only if, (xi,yi) G B, (x2,y2) G B 
and xi = X2. This is pp-definable, by i?(xi,yi) A 5(x2,y2) A (xi = X2), and is clearly an 
equivalence relation. Hence it is a congruence. Similarly, define a congruence ~2 on D'^'^^ by 
(xi,yi) ~2 (x2,y2) if, and only if, (xi,yi), (x2,y2) G B and yi = y2. Let -0 =~i o 

Suppose ((a, c), (b, d)) G ip. Then there exists (u,v) G B such that (a, c) ~i (u, v) ~2 
(b, d). Thus, (u, v) = (a, d) and, hence, (a, c), (a, d), (b, d) G B. Congruence permutability 
implies ((b, d), (a, c)) G tp. Hence there exists (u', v') G B such that (b,d) ~i (u', v') ~2 (a, c). 
Thus, (u', v') = (b, c). Therefore we have (b,c) G B and F is strongly rectangular. □ 

Corollary 7. F is congruence permutable if, and only if, it has a Mal'tsev polymorphism. 

Proof. This follows directly from Lemmas [5] and [H □ 

We will now consider the complexity of deciding whether F is strongly rectangular. 
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Lemma 8. We can decide whether T is strongly rectangular in C'(|[r|['^) time and, if so, deter- 
mine a Mal'tsev polymorphism cp. 

Proof. Observe that there are at most possible Mal'tsev operations D. This 

follows since there are q{q — 1)^ triples a,b,c & D which have b ^ {a,c}. For all other triples, 
the value of if{a, b, c) is determined by the condition that if is Mal'tsev. Thus, there are 0{1) 
possibilities for ip. For an r-ary relation H £T with £ tuples, we can check in 0{l^r) = 
time whether H is preserved by any of them. If so, we have ip ; if not, F is not strongly 
rectangular. □ 

Remark 3. We have assumed that g is a constant in Lemma [H We revisit this question in 
Section [8l where we make no such assumption. 

In view of Lemma [HI we may assume that we have determined a Mal'tsev polymorphism ip 
for any given strongly rectangular F. 

Strongly rectangular constraint languages have another useful property. For each a S D, 
define the constant relation Xa = {(o)}- Then the constraint Xaixi) fixes the value of Xi to be a. 

Lemma 9. IfT is strongly rectangular, then so is T' = T U {xa}- 

Proof. By Lemma O F is preserved by a Mal'tsev polymorphism ip. Since (p{a, a, a) = a for any 
a & D, ip also preserves Xa- Thus ip preserves F', so F' is strongly rectangular, by Lemma [5l □ 

In the light of Lemma [9l we may assume that {xa '■ o- € D} C F whenever F is strongly 
rectangular. 

Remark 4. More generally, the property of a polymorphism ip that we have used in Lemma [9l 
that ip{x for any x G D, is called idempotence in the algebraic literature on CSP. 

4 The structure of strongly rectangular relations 

Let R C be a strongly rectangular relation. For any i € [n], we say that an n-tuple t £ R 
is a witness for a £ pf^R if ti = a. We will abbreviate this by saying that t witnesses (a, i). 
If t = (u, a,v) S R, we call u a prefix for a. Now define a relation ~j on pr^i? by a ~j 6 if, 
and only if, there exists u E which is a common prefix for a and b. That is, there exist 

VaiVfe G Z)""* such that (u, a, v^), (u, 6, v;,) € R. 

Lemma 10. ~j is an equivalence relation on pr^R and a congruence in (R). 

Proof. Consider the binary relation B on prjj^^ji? x pr^R defined by B{u,a) = 3yi?(u, a,y). 
Then ~j is the equivalence relation 02 of Corollary [2l which is a congruence by Corollary [3j □ 

Let £i^k {k G [Ki]) be the equivalence classes of ~i for € [q], i € [n]. Observe that ki = 1, 
since all a € pr^i? have witnesses with the common empty prefix. More generally, we make the 
following observation, which follows directly from the block structure of the relation B in the 
proof of Lemma [TOl 

Corollary 11. There is a common prefix Uj j, € D^~^ for all a € £i^k (k S [Ki],i € [n]) and we 
can choose Uj^^ to be any prefix of any a € 8i^k- 
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Following Bulatov and Dalmau [4J, if H is any relation and ip a Mal'tsev operation (i.e., a 
ternary function that is not necessarily a polymorphism but has the property that (p{a, b, b) = 
(p{b,b,a) = a for all a,b £ D), then c\^H is the smallest relation that contains H and is closed 
under if. Clearly c\^H is a strongly rectangular relation with polymorphism (p and we say that 
the H generates c\^H. The following observation, from [4J, gives a simple but important fact. 

Lemma 12. Let H be an n-ary relation. If I [n], then d^pr^H = prjd^H. 

Proof. Consider generating d^pr jH while retaining all n columns of H. Each row of the resulting 
n-ary relation will be in d^pH, so we have d^pprjH C pr^-cl^if. But further operations to generate 
d^pH cannot add new rows to dipprjH. So, in fact, we have cl^pr^-ff = pr^cl^//. □ 

Let S = {ti, t2, . . . , t<j} be a set of n-tuples, presented as an s x n matrix. If / C [n], we will 
need to compute a relation T C d^pS such that pr^T = cl^pr^S = pTld^pS. 

Lemma 13. If H- = \prJd^pS\ and s = \S\, then a relation T C d^S such that pr jT = prjd^S 
can be computed in time 0{ni^ + 

Proof. Consider the algorithm CLOSURE, on the following page, 
procedure Closure(/) 

1: £ ^ S, ji ^ 2 

2: while ji < ^ do 
3: for j2 G [ji] do 
4: for js e [j2] do 

5: for all permutations {ki,k2,k^) of {ji.,32-,jz} such that k2 ^ {^1,^3} do 

6: U 93(tfc^,tfc2,tfc3) 

7: if there is no j E [I] such that pr/tj = pr^u then 

8: £^£+1, tf^U 

9: jl ^ il + 1 

The correctness of Closure is trivial. At termination, all triples {ki,k2,k-i) G [Pf' have 
been considered for generating new n-tuples (in line [6]), so we have computed cl^pr^^S'. The 
analysis is equally easy. There are l'^ triples {ki, ^2,^3). For each triple, the generation in line [6] 
takes 0{n) time and the search in line [7] requires 0{si) time, with the obvious implementations. 
Thus, the total is 0{n£^ + s£'^). □ 

The procedure outlined in has complexity 0{ni^ + s£^), since the same triple (A:i, ^2, ^3) 
can appear Q.{t) times. The procedure Closure simply avoids this. 

The time complexity of Closure could be improved, for example, by using a more sophisti- 
cated data structure to implement the searches in line [71 However we do not pursue such issues 
here, or elsewhere in the paper. 

Now we define a frame for an n-ary relation to be a set F C. R such that 

(a) pr^F = pr^R for each i S [n]; and 

(b) there is a Vj^^ S D*^^ for each equivalence class Si^k of ~i {k G [/^^],'^ G [n]) such that, for 
each a G Si^k, there exists a Wq G F with pr[j]Wa = Vj^fcO. 

Clearly, R itself satisfies the definition of a frame, so every relation has at least one frame. 
However, we will show that strongly rectangular relations have frames that can be much smaller 
than R and we call a frame for a strongly rectangular relation R C small if \F\ < n{q — l)-\-l. 
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A witness function for a frame F of the relation i? is a function uj: D x [n] F such 
that u{a,i) witnesses (a,i) for all a G pr^i? and i G [n] and pr[j_;^]a;(a, i) = pr\^^_^u{h,i) when 
a h. That is, u{a,i) returns a witness for (a,z) and, if (ai,i), . . . , {ak,i) have witnesses with 
a common prefix, then u returns such witnesses. 

Lemma 14. Let F be a frame for a strongly rectangular relation R C D^. We can determine a 
small frame F' for R and a surjective witness function uj' : D x [n] ^ F' in time OdlFjp). 

Proof. In time we can compute the relations ~j (i € [n]) and common prefixes for 

each ~j-equivalence class. Hence, we can compute a witness function u for F. Further, we may 
delete from F any tuple t for which a^~^(t) = 0. Because is a witness function, the resulting 
set is still a frame for R and has size at most Yli^[n] |P''i^l — ''^Q- 

Now we construct F' and uj' as follows. Choose any f G F and set F' = {f}. Then, for each 
i € [n], do the following. Let g = u{fi,i) and set Lj'{fi,i) f. Now, consider in turn each 
a ^ fi such that a ~j fi and let h = u{a,i). Note that g and h have the same prefix u' £ D^~^, 
since is a frame, and suppose f has prefix u € Then set h' ^ ip{{, g, h), F' F' U {h'} 

and uj'{a,i) ^ h'. Since 



t 


u 


n 


V 


g : 


u' 


fi 


v' 


h : 


u' 


a 




h' : 


u 


a 


'/'(v, v', Va) , 



this ensures that F' retains property |(b)| of a frame. Having performed these steps for each i £ [n], 
we deal with those a e pr^F with a fi by setting F" F' U {a^(a, i)} and a;'(a, i) ^ ^(a, "i). 

The final size of F' can be bounded as follows. The tuple f witnesses {fi,i) for all i G [n]. 
Then, for each i G [n], there is at most one tuple in F' witnessing (a, i) for each a G prj-R\ {fi}- 
Since there are, in total, (|P''i-^l — l) < "■('7 — 1) such pairs (a,i), it follows that F' is a 

small frame. 

The time bound is easy. Given the function u, we can determine the h' in 0{n) for each 
i G [n]. All other operations require 0{1) time for each i G [n]. Thus, we can need only 
O(n^) = 0(||F|p) time once we have determined u, which can also be done in 0(||F|p) time. □ 

Remark 5. The upper bound for the size of a small frame is achieved by the complete relation D": 
We exhibit a small frame for in Lemma [T2] below. However, a frame can be much smaller than 
this upper bound n(g— 1) + 1. Consider, for example, the n-ary relation R = {(a, . . . , a) : a G D}. 
It is easy to show that R is strongly rectangular. However, it is also easy to see that F = R is 
a frame, with u{a, i) = (a, . . . , a) (i G [n]) and |F| = q. 

Remark 6. The compact representations of Bulatov and Dalmau [4J are not necessarily frames 
and can have size nq'^/2. However, it appears that a frame could be constructed efficiently from 
such a representation using methods similar to those of Lemma [TH 

We will suppose below that all frames are small. If necessary, this can be achieved using 
Lemma [m Note that we do not assume that a frame for R can actually generate R, since this 
is entailed by the following. 

Lemma 15. If R is strongly rectangular with Mal'tsev polymorphism (p and F is a frame for 
R, then cLF = R. 



12 



Proof. F C R so cl^pF C cl^i? = R. It remains to show that R C d^pF. 

We show by induction on i G [n] that pr^i^R C prjjjcl^F. The base case, z = 1, is trivial 
as pr;^it! = pfiF by definition. Suppose that prjj_yi? C pr[j_ycl^F and let t = {ti,...,tn) = 
(u,ij,v) G i2. By the inductive hypothesis, we have u € pr[j_;^]Cl^F so there is a tuple t' = 
(u, t^,v') G cl<^-F C R. Therefore, t'^ ~j tj, which means there are tuples (u', tj,w) and {u',t'-,-w') 
in F witnessing {ti,i) and {t[,i), respectively. Thus, we have 

u t'i v' 

u' t[ w' 

u' ti w 

U tj (^(v', w', w) . 

Therefore, (ti, . . . ,ti) G prjjjCl^F, continuing the induction. □ 

Given ip and the matrix for F, the procedure of Lemma [15] can be used to decide t E i? 
in time 0{n^). There is no need to generate the whole of R; we just keep track of the tuple 
(u, ti, ip{v', w', w)) that witnesses that (ti, . . . , tj) G prj^jcl^-F. If the procedure succeeds, we have 
demonstrated that t G d^pF = R; otherwise, we conclude either that t ^ R oi that R is not 
strongly rectangular. 

We now show how, given a frame for R, we can determine a frame for the relation 

R{ai, . . . ,ai,Xi+i, . . . ,Xn) = {t e R : (ti, . . . ,ti) = (ai, . . . ,0^)} . 

Lemma 16. Given a small frame F for R{xi,X2, • • • , Xn), a frame for R{a, X2, • • • , x„) can be 
constructed in O(n^) time. 

Proof. We abbreviate R{a, X2, • • • , x„) to i?(a, •). For each i = 2, . . . ,n, determine cl^pr^ ^F = 
priid^F = pr^R. Note that |pr;^ < and ||F|| = 0{n) so this requires 0{n) time for each 
i, and 0{n'^) time in total. We have (a, 6) G pr^R if, and only if, b G pr^R^a,-). Also, we have 
calculated a witness (with respect to R) for each b G prji2(a, •). Let be the usual congruence 
for R and ~^ the corresponding congruence for i?(a, •). Clearly b c implies b ~j c, since there 
are witnesses (a, u, 5, v), (a,u, c, v') G R. On the other hand, if 5 c and b G prji?(a, •), then 
c G prji?(o, •) and 6 ~^ c, since we have 

a u b V 

a' u' 6 v' 
a' u' c v" 



u 



c v?(v,v',v") 



Thus, the equivalence classes of ~^ are a subset of those of ~j. Therefore we can construct ~^ 
and a witness for each b G prji?(a, •), using F and the n-tuples from the calculation of pr^ ji?. □ 

The following corollary is immediate, by iterating the Lemma [16] i < n times. 

Corollary 17. Given a frame F for the relation R{xi,X2, . . . ,Xn), we can construct a frame 
for R{ai, . . . ,ai,Xi+i, . . . ,Xn) in O(n^) time. 
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5 Constructing a frame 



If R is F-definable, then t G R can be decided in polynomial time by checking that t satisfies 
each of the defining constraints. We cannot use this method to decide -R = efficiently but this 
can be done trivially using any frame F for R, since i? = if, and only if, F = 0. If F 7^ 0, then 
any f G F is a certificate that i? 7^ 0. Similarly, given a frame for R and any tuple (oi, . . . , a,), 
we can determine whether there is any t £ R such that (ti, . . . ,ti) = (oi, . . . , aj), using the 
method of Corollary [T71 

However, we must be able to construct some frame F for R efficiently. If T is strongly 
rectangular, we will show how to determine a frame for a F-formula $ having m constraints 
in n variables, in time polynomial in m, n and ||r||. This is achieved, as in [4J, by adding the 
constraints sequentially. 

If the m constraints are 9i, 62, . . . , ©m, let <I>s = Gi A 02 A • • • A B^. Thus, <i>o = -D", the 
complete n-ary relation on D, and = ^- We begin by constructing a frame for <I>o- 

Lemma 18. A small frame Fq for <I>o can be constructed in 0{n) time. 

Proof. Let d be any element of D and let Fq = {f^} U {t"'* : i € [n],a G D \ d}, where 



t^ = d and = <! " ^f-? ^ g [n]). 

■' ' otherwise 



Clearly all these tuples are in $o- Also u{d,i) = and uj{a,i) = f^'* (a 7^ d), for all i € [n], 
is a witness function. Further, we have pr[j_;^]t"'* = prjj^j^jf^ = {d, . . . ,d). Thus, Fq satisfies the 
conditions for being a frame. We have \Fq\ = n{q — 1) + 1, so Fq is small. □ 

Note that \Fq\ matches the upper bound for the size of a small frame. 

Now, we show how to determine a frame for $s given a frame for $s_i. We first show that 
this can be done in polynomial time when ||r|| = 0{1). This is nonuniform CSP, the most 
important case. 

Lemma 19. Given a frame F for $ and a constraint Q, a frame F' for <!>' = <I> A can be 
constructed in 0{n'^) time. 

Proof. Suppose that = H{xij^,Xi^, . . . ,Xi^), where H G T has arity r. We will assume that 
Xij^,Xi2, . . . , Xi^ are distinct since, otherwise, we can consider a smaller relation H' over the 
distinct variables. Let / = {ii, Z2; • • • ) v}- For each i € [n], let Jj = / U {i} and determine 
Tj C $ such that prjTj = cl<pprj.<I> using Closure, li ^ = |pr/$|, then |ri| < qi, so this takes 
time 0{n(.^ + r£^) by Lemma[13l' But, since ||r|| = 0(1), we have r = 0(1), i < = 0(1) and 
0{nl'^ + r^^) = 0(n). The entire computation for all i therefore takes time 0{n'^) and we have 
T.^\n = 0{n). 

Determine C/j, the set of tuples in Tj that are consistent with 0, so C Now [/, contains 
a witness for each a £ prj$', since 

^rjVi = (pr^T,) n = (cl^pr^^F) n = (pr^,$) n = pr^^(<I. A 0) = pr^^L' . 

Thus, in particular, pr^f/j = prj<I>'. We now do the following for each i E [n]. 

Let A prjC/j and repeat the following until A = 0. Choose t £ Ui such that tj G 
A. Determine a frame F* for <I>(ti, . . . , Xj, . . . , x^) in 0(n'^) time, using Corollarv 1171 
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Clearly t G d^F*, so F* ^ 0. Now determine the intersection of Q with the relation R* = 
^{ti, . . . , ti-i,Xi, . . . , Xn) generated by F* using Closure, as was done for <I> above. This takes 
0{n) time; let the resulting relation be R°. Now, by Corollary II 11 pr^R° is the equivalence class 
£" = {a : a ~^ ti} of ti in For each a G £, we can find a witness a;'(a, i) G R° for a € prj<I>' 
and these have the common prefix [ti, . . . , U-i). We set ^ ^ \ and repeat. 

At the end of this process, u' is the witness function for a frame F' for <!>'. The total time 
required is 0{n^\F'\) = 0{n^). □ 

Lemma 20. A frame F for <1> can be constructed in time 0{mn'^). 

Proof. Construct <l>o in 0{n) time. Then, apply Lemma [T9l to construct a frame Fi for <l>i from 
a frame Fi^i for <l*j-i, for each i € [m]. At termination, set <^ ^ <I>m and F ^ Fm- Q 

Since a relation has for a frame if, and only if, it is empty (and has no other frame), we 
can determine in time O(mn^) whether there is a satisfying assignment to a CSP instance in a 
fixed strongly rectangular vocabulary. By Lemma [5l we have re-proven the main result of [1] . 

We assumed above that ||r|| = 0{1). However, we can still perform the computations of 
Lemma [19] in time polynomial in m, n and |[r||. 

Lemma 21. A frame for <I> can be constructed in time 0{mn^ + mn^ ||r||^). 

Proof. We indicate how the proof of Lemma [19] must be modified. It is only the computation 
of the Ui that requires improvement, which we achieve by using a device from Suppose we 
wish to add a constraint = H{xi^,Xi^, . . . , xi^) to <I>. Instead, we add in turn the r constraints 
Gfc = Hk{xij^,Xi^, . . . ,Xii^), where Hk = pr^^^H for each k G [r]. Thus, \Hi\ < q and Hr = H. 
Letting = <I>, we successively calculate frames for = "^k-i A {k £ [r]), so = 
If Ik = {ii,i2, ■ ■ ■ ,ik} {k £ [r]), we have 

4 = |pr/^^'fe-i| < g|pr^^_^^'fc_i| < q\Hk-i\ < q\H\ . 

Thus, for each G [r], the time required to compute Ui and R° in Lemma[T9lbecomes 0{n'^\H\^ + 
nr\H\^). In total, the time requirement is 0(n^r|i?p+nr^|i?|^) = C'(n^||iJ||'^) = ©(n^HrH^). □ 

6 Counting problems 

We consider the problem of determining l-R^I, which we abbreviate to \^\, where $ is a L-formula 
with m constraints and n variables. We require the computations to be done in time polynomial 
in the size of the input ^ and we assume ||r|j = 0{1). In fact, the size of ^ can be measured by a 
polynomial in n. A repeat of a constraint can be removed, since this does not change Then 
an r-ary relation in T can give rise to 0{n^) constraints. We will assume that every variable 
appears in at least one constraint. Otherwise, suppose no variables do not appear: letting <!>' be 
$ with these variables deleted, we have \^\ = q"''^\^'\. Hence we will assume that m = 

Following Bulatov and Dalmau |6], we call this computational problem ^CSP{T). If F = 
{H,=}, we write i^CSP{H). We will use the following result from [6j, which we prove here for 
completeness. The corollary is immediate. 

Theorem 22 (Bulatov and Dalmau [6j). Let (3 = {D,T), & = {D,T') be relational structures 
with F' C (F). Then #CSP(F') is polynomial-time reducible to #CSP(F). 
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Proof. Let each H' £ T' have pp-definition H'{x) = 3y H* {x,y), with i7*(x, y) a F-formula. 
If ah relations in F have arity at most r and at most i tuples and all the formulae H* are 
conjunctions of at most k constraints, then each H* has arity at most kr and \H*\ < Observe 
that k, I and r are constants in ^CSP(r'). 

Consider any V formula <I>(x) = ©i A • • • A ©m, where x = (xi, . . . , Now, if ©j = H'{x), 
let 0* = H*{x,yi), where the {i G [m]) are new variables. Let z — (yi; • • • ^Ym) and consider 
the F-formula <I>*(x, z) = ©J A • • • A 0J^. This is an instance of 7^CSP(F), with at most km 
constraints and n + krm variables. Now, for x E let 

Ni{x) = |{y, : (x,yO G ©*}| < \H*\ <t (i G [m]), 

and let = max{7Vj(x) : i G [m], x G < Now let 

^,(x) = |{^GH:iV,(x)=j}| (iG[iV]). 

Clearly X^^i Aij(x) = m for all x G Let 

M = {(^i(x),...,^iv(x)) :xG^>}. 

Let L = |M|. Clearly, |M| < m^, so L has bit-size 0{ra). Now, for m G M, let 

K{rn) = |{x G $ : /ij(x) = m^, j G [N]}\ < < . 

Thus, |^>| = ^^gM^M- Now let J(m) = njii < Thus, the J(m), K{m) (m G [M]) 
are numbers with 0{m) bits. Then we have 

N 

1^1 = E n = E ^mIIj'"^ = E ^(m)j(m). 

xG* iG[m] meM j=l meM 

Now, for s G [L], consider the F- formulae 

$:(x,zi,...,z,) = /\ $*(x,Zi), 

where Zj (z G [s]) are distinct variables. Then <I>* is an instance of #CSP(F), with at most kms 
constraints and krms variables, and we clearly have 

\^:\ = K{ui)j{uir. 

meM 

Note that $* is of size polynomial in m. Therefore we can evaluate j<I>*| for all s G [L] using a 
polynomial number of calls to an oracle for #CSP(F), each having input of size polynomial in 
m. It then follows, using [Ml Lemma 3.2], that we can recover X^meM -^("^) ~ 1^1 from the 
values of the |$*[ {s G [L]) in time polynomial in L, which is polynomial in m. □ 

Corollary 23. If H e (F) and #CSP{H) is ffP- complete, then #CSP(F) is H^P -complete. 

First, we apply Corollary 1231 to give a short proof of the main result of [6j. (Bulatov and 
Dalmau phrase the result in terms of the existence of a Mal'tsev polymorphism but, by LemmaO 
our phrasing is equivalent.) 
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Lemma 24 (Bulatov and Dalmau If the constraint language T is not strongly rectangular, 
then #CSP(r) is -complete. 

Proof. Clearly ^CSP{T) G ^^P for any T. If T is not strongly rectangular, there is an r-ary 
relation B G (F) that is not rectangular when considered as a binary relation over x D^'^^ for 
some k with 1 < k < r. Let G = {V, E) be a connected, undirected bipartite graph with vertex 
bipartition Vi, V2. Let <I>i be the L-formula with a constraint B(xi,Xj) for each {I'i, i^j} € E with 
Vi £ Vi, Vj G V2. Define $2 analogously, but with constraints B{xj,Xi). It follows that |<I>i| + |<I>2| 
is the number of graph homomorphisms from G to Gb- This problem is #P- complete by |16j . 
since Qb has a component which is not a bipartite clique. Thus, #CSP(i?) is #P-complete and, 
hence, #CSP(r) is #P-complete by Corollary [23l □ 

There is an important generalisation of the counting problem to weighted problems which we 
now describe briefly; see [8lll4] for details. The relations H C W in T are replaced by functions 
/: Q^, where Q"*" denotes the non- negative rationalsH Thus, F is replaced by a set of 

functions J-". We will call {D,J^) a weighted structure. The underlying relation of f £ T is 
{u € : /(u) > 0}. Note that a relation H can be identified with a function fu ■ — )• {0, 1}, 
where /h(u) = 1 if, and only if, u £ H. Then H is the underlying relation of fn- Thus, we may 
just use H to denote the function fn without further comment. 

Now, using notation similar to the relational case, an instance I of #CSP(J-') is defined as 
follows. A constraint Q has the form f{xi^,Xi2, . . . ,Xi^) for some r-ary function f £ T. Thus, 
(fj^, fj2) • • • > ^^v) is the scope of the G. Suppose we have constraints Gi,...,6m, where 6^ 
applies the function fg S T . Write for (xj^, Xjj, . . . , Xj^), where (fj^, fjj) • • • > ^^v) is the scope 
of the 0^. Then, the weight of an assignment x: V D is 

m 

w(x)=n/,(x,). 

s=l 

The computational problem ^(1SP{J-) is then to compute the partition function, 

Z{Z)= W(x). 

x: V^D 

If T = {/} for a single function /, we write #CSP(/). 

We may view a binary function f : Ai x A2 ^ as a matrix with elements in Q"*", rows 
indexed by Ai and columns indexed by A2. If B is its underlying relation, the submatrix of / 
induced by a block of B is called a block of /. If /i, /2, . . . , are the blocks of /, then / will be 
called a rank-one block matrix, if each block of / is a rank one matrix. 

Lemma 25. If f : Ai x A2 is a rank-one block matrix, its underlying relation B is 

rectangular. 

Proof. If B is not rectangular, there are (a,c), (6, c), (a, d) G R such that (6, d) ^ B. The 2x2 
sub-matrix of / induced by rows a, b and columns c, d is included within a single block and has 
determinant —f{a,d)f{b,c) ^ and so has rank 2. Therefore, the block of / that contains this 
sub-matrix has rank at least 2. □ 

"^More generally, we can take the function values to be non-negative algebraic numbers. 
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We will call a matrix / : A1XA2 — ?> rectangular if its underlying relation R is rectangular. 
Thus, an alternative way of defining a rank-one block matrix is as a rectangular matrix /, together 
with functions qi : Ai ^ Q+ 02- A2 ^ Q^, such that f{x, y) = ai(x)a2(y) for all (x, y) € B. 

We can now state a theorem of Bulatov and Grohe [8, Theorem 14], which generalises the 
result of Dyer and Greenhill [16] to the weighted case. Although we give the theorem for non- 
negative rational functions, in fact we only require the case for non-negative integer functions. 

Theorem 26 (Bulatov and Grohe [8|). Let f ■ Ai x A2 — s- be a binary function. Then 
7^CSP(/) is in FP if f is a rank-one block matrix. Otherwise #CSP(/) is ^P-hard. 

In Section [7.11 we will use the following property of rank-one block matrices. 

Lemma 27. If f : Ai x A2 is a rank-one block matrix, it is uniquely determined by its 

underlying relation and its row and column totals. 

Proof. Let B be the underlying (rectangular) relation. Consider any block C of B, with pr^C = 
Si, pr2C = 5*2. Then there exist ai : 5i — )• and Q2 : 5*2 such that /(xi,X2) = 

ai{xi)a2{x2) for every xi G Si and X2 G 5*2. Now, let 



f{xi,-) 




= 01(2:1) E "2(3:2) 






X2&S2 


f{;X2) 




= 0L2{X2) E 








/(v) 




= E (^l{xi) E "2(2:2) 




xiG5i 


xisSi X2&S2 



be the row, column and grand totals of f{xi,X2) (xi G S'i,X2 G ^2). A simple calculation gives 

^ /(xi, ■)/(■, X2) 
/(X1,X2) = j^— ^ . □ 

7 The dichotomy theorem 

We are now ready to describe the dichotomy. We saw in the previous section that, assuming FP 7^ 
#P, strong rectangularity is a necessary condition for tractability. In this section, we introduce 
a stronger condition, based on certain rank-one block matrices and show that it characterises 
the dichotomy for #CSP, into problems in FP and problems which are #P-complete. As one 
would expect, this condition turns out to be equivalent to the criterion in Bulatov's dichotomy 
theorem. We defer the algorithm for the polynomial-time cases to Section lY.ll and some technical 
results to Section 17.21 In Section [HI we will show that the condition is decidable. 

Let H{x, y, z) be a ternary relation on Ai x A2 x A^. We will call H balanced if the balance 
matrix, 

M(x, y) = |{z G ^3 : {x, y,z)£H}\ (x G ^1, y G A2) 

is a rank-one block matrix. A relation of arity n > 3 is balanced if every expression of it as a 
ternary relation on x x (d, £ > 1, /c -|- £ < n) is balanced. We wih say that T is 

strongly balanced if every pp-definable ternary relation is balanced. 
We will prove the following dichotomy theorem. 
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Theorem 28. // T is strongly balanced, #CSP(r) is in FP. Otherwise, #CSP(r) is #P- 
complete. Moreover, the dichotomy is decidable. 

Proof. The first statement will be proved in Section 17. 1[ The second is proved in Lemma [3T] 
below. The third is proved in Section [HI □ 

We first show that the condition of strong balance is strictly stronger than that of strong 
rectangularity. 

Lemma 29. Strong balance implies strong rectangularity. 

Proof. This follows from the definition of strong balance. Suppose T is strongly balanced and 
let B{x,y) be any definable binary relation. Let 

H{x, y, z) = 3w B{x, y) A B{z, w) , 

which must be balanced. Then M{x,y) = \{z : 3w B{z,w)}\ = \priB\, for all {x,y) € B. If 
|pr]^5| = then B = which is trivially rectangular. Otherwise, the underlying relation of M 
is -B, which must be rectangular by Lemma [25l □ 

The converse of Lemma [29] is not true, however. 

Lemma 30. Strong rectangularity does not imply strong balance. 

Proof. Consider the following example. Let A = {ao,0) ao.ii oi.Oi &} and let D = AL\ {0, 1}. 
Let r = {i?}, where R is the ternary relation given by 

R={{i,j,ai^j) : i,j G {0, 1}} U {(0, 0, 6)} . 

Note that b is, in effect, a second copy of ao,o; the effect is essentially that of a weighted relation 
where the tuple (0,0, ao,o) has weight 2 and all other tuples have unit weight. The balance 
matrix M for R is as follows (we omit the rows and columns for x ^ A as they have only zeroes) : 



M 



1 
2 1 

1 1 



M is clearly not a rank-1 block matrix, so R is not strongly balanced. Nonetheless, we will 
show that R has a Mal'tsev polymorphism. Consider the following function, where © denotes 
addition modulo 2. 



f{x,y,z) 



x®y®z 

0'f(i,k,m),f(j,e,n) 



if x,y,z e {0, 1} 



if X = a 



ao,o otherwise. 
Let g{b) = ao,o and g{x) = x for all other x (z D. We define the function ip as follows: 



(pix,y,z) 



X if y = z 

z if X = y 

, f{9{x),g{y),g{z)) otherwise. 
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In other words, ip behaves identically to /, except that it has the Mal'tsev property and, for 
inputs where x ^ y and y 7^ z, it "pretends" that any input of b is actually an input of ao,o- 
Note that, for i, k S {0, 1}, ip{i,j, k) = i ® j ® k, regardless of the Mal'tsev condition. 

We claim that, as well as being Mal'tsev, (/? is a polymorphism of R. To this end, let 
X, y, z € ii, which we can write as x = (i,j,x'), y = {k,i,y') and z = {m,n,z'), where x' = aij 
or, if i = J = 0, we may have x' = b, and similarly for y' and z'. So, we have 

</'(x,y,z) = {ip{i,k,m),ip{j,e,n),ip{x',y',z')) 

= {f{h k, m), f{j, e, n), f{g{x'),g{y'),g{z')) 
= {fih k, m), fij, e, n),af(^i^k^jn)j{j,e,n)) 
eR. 

This establishes the claim. □ 

Remark 7. The example in Lemma[30lcan be extended to relations of arbitrary size by extending 
i and j in the tuples aij) to longer binary strings and interpreting © as bit-wise XOR (e.g., 
0011 ©0101 = 0110). 

Remark 8. Bulatov and Dalmau conjectured in [5] that a Mal'tsev polymorphism was sufficient 
for ^CSP(r) to be in FP. That is a stronger claim than the converse of Lemma [29l The 
conjecture was withdrawn in [6j, with a counterexample somewhat similar to that in the proof 
of Lemma [30l 

Next, we strengthen Lemma [2^ to prove one half of the dichotomy. 

Lemma 31. IfTis not strongly balanced, then 7^CSP(r) is ^P-complete. 

Proof. If r is not strongly balanced, there is an unbalanced ternary relation H G (L). Let E be 
a binary relation with y = Vi U F2, ^1 n V'2 = and pr^E = Vi {i = 1, 2). Let <I) be the L-formula 
with a constraint H{xi,Xj, Zij) for each [ui^Uj) G E. Thus, $ has \V\ + \E\ variables and \E\ 
constraints. Let M: Vi x V2 ^ be $'s balance matrix. 

We have |$| = Z{X), where Z{X) is the partition function for an instance X of ij^Q.SP{M) with 
input E. But this problem is #P-hard by Theorem 1261 and, hence, ^Q.SP{H) is #P-complete. 
Thus, #CSP(r) is #P-complete by Corollary [23 □ 

In [5, Bulatov defined congruence singularity. Suppose L is a constraint language and pi 
and p2 are two congruences defined on the same pp-definable set A C D^. Let the equivalence 
classes of pi be Eij (j € [z/j], i = 1, 2). Further, let 

M{3, k) = \Eij n E2k\ (j e [z^i], k e H). (i) 

r is congruence singular if is a rank-one block matrix for every pair pi, p2 of congruences]! 

Lemma 32. T is congruence singular if, and only if, it is strongly balanced. 

Proof. Suppose T is strongly balanced, let ^4 C be defined by the formula x and let pi, P2 G 
(r) be congruences defined on A C with equivalence classes Eij (j S [fj], i = 1,2). Then 
V'(x, y, z) = x(z) A /Oi(x, z) A /02(z, y) is a ternary relation. Hence, for any x S Eij and y G E2k, 
the matrix 

M(x,y) = |{z : x(z) A pi(x, z) A ^2(2, y)}| = \Eij n E2k\ 
^In fact, Bulatov applies this term to the associated algebra, but with essentially this meaning. 



20 



is a rank-one block matrix. But M has a set of identical rows for all x E Eij (j G [z^i]) and a 
set of identical columns for all y € E2k {k € [^^2])- The matrix M has one representative from 
each of these sets. It follows that Ai is a rank-one block matrix. 

Now, suppose that T is congruence singular and let H G (F) be any ternary relation. Define 
relations pi = {(x, y) : x, y € and Xi = yi} {i = 1,2). These are trivially equivalence relations, 
and are pp-definable as X2, X3) A H{yi,y2,y3) A (xj = yi). Thus, they are two congruences 

defined on the same set, H, which is also pp-definable. The equivalence classes of pi clearly 
correspond to Zj G p<^iH {i = 1,2) and we may index these classes by Zj. Thus, 

M{zi,Z2) = |{(xi,X2,X3) e H : Xi = Zi,X2 = Z2}\ 

= |{X3 : {zi,Z2,X3) G H}\ 

= M{zi,Z2). 

Since is a rank-one block matrix by assumption, so is M, and the conclusion follows. □ 

In [3], Bulatov established the following theorem, giving a dichotomy for #CSP that is 
equivalent, using Lemma [321 to Theorem [251 except that the decidability of the dichotomy 
remained open. 

Theorem 33 (Bulatov [3]). IfT is congruence singular, ^CSP{T) is in FP. Otherwise #CSP(r) 
is -complete. 

7.1 The counting algorithm 

This section is devoted to a proof of the polynomial-time case of the dichotomy theorem. 

Lemma 34. Let T be strongly balanced and let R G (F) be an n-ary relation. Given a frame F 
for R, \R\ can be computed in O(n^) time. 

Proof. If n = 1 then R = pr^^i? = pr^i^ = F so \R\ = \F\ and we are done. So we may assume 
that n > 2. Now, for 1 < i < j < n, define Nij : pr^R ^ N by 

Ni,j{a) = |{(u,a) G pr[i]u{j}-R}| . 

Since we have 

\R\ = Yj Nn-l,n{a) , 

we need to compute the function Nn-i^n, which we do iteratively. For each j G {2, ...,n}, 
-^i,j(a) = \{b G prii? : {b,a) G pri jR}\. By Lemma \T3\ these quantities can be computed by 
using F to determine pfij-R, in total time O(n^). (Note, in particular, that |pr^ji?| < = 0{1) 
and F may be assumed to be small so \F\ < 0{n).) To continue the iteration, we use Ni^i^i and 
Nij to compute Ni+ij for j = i+2, . . . , n. We repeat these computations for each i = 1, . . . , n—1. 

Consider a particular i and j and suppose that we have computed A^i-i,fc for all k > i. Let 
J = [z] U {j} and let H = prjR, which we will express as a ternary relation 

H = {{u,x,y) G pr jR : u G pr[i_i]R, x G pr^R, y G pr^R} . 

Since R is strongly balanced, the matrix 

M{x,y) = \{u G pr[i_i]i? : (u,x,y) G H}\ 
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is a rank-one block matrix. The block structure of M is given by the relation prjji?, since if 
G pr^ jR, there is at least one t € i? with pr^t = x and pr^t = y. By Lemma \T3\ we can 
compute p^i jR in 0{n) time, using F. 

For notational simplicity, let us write Vi = pr^R. Consider M{-,y), the y-indexed row of M. 
We have 

Y,M{x,y)= \{u:{u,x,y)eH}\ = \{{u,x):{u,x,y)eH}\=Ni,,{y). (2) 

Now observe that the relation By{u,x) = {{u,x) : {u,x,y) € H} is rectangular, by Lemma [9j 
Write Sy{x) = {u : (u, G -fT}. By Corollary [21 there is an equivalence relation on Vj 

0y{xi,X2) = 3u {H{\i, xi,y) A H{u, X2, y)) 

such that Sy{xi) and Sy{x2) are equal, if 9y{xi,X2), and disjoint, otherwise. Thus, if S{y) C Vi 
contains one representative of each equivalence class of 6y, then 

^ M(x,y) = |{u:3x(u,x,y)GF}|=iV,_i,,(y). (3) 

Now, suppose that 6y{xi,X2) and y' 7^ y. Thus, H{u,xi,y) and H(\i,X2,y) for some u, so 
(x2,y) G C for some block C of pr^jR- There is u' such that H{u', xi,y') if, and only if, 
{xi,y') G C. But then we have 



u' 


Xi 


y' 


u 


Xi 


y 


u 


X2 


y 


u' 


X2 


y' 



and, hence, 9yi{xi,X2). Thus, the equivalence relations By depend only on the block C containing 
y. Thus, we may deduce the classes of 9y from P^^ijR and those of the relation defined by 

xi '-^ij X2 3u,y {H(u,xi,y) A H{u,X2,y)) . 

We prove in Section [7.21 below, that the are congruences in (R). Thus, the matrix M has 
identical columns corresponding to the equivalence classes of 

Similarly, there are identical rows corresponding to the equivalence classes of where 

2/1 2/2 3u,x {H{u,x,yi) A H{u,x,y2)) . 

(There is no ambiguity of notation between ^^ij and since we have i < j.) 

We prove in Section [7.21 that the are also congruences in (R). Now, if S'{x) contains 
one representative of each of the classes of the corresponding equivalence relation 6'^, we have 

J2 M{x,y) = \{u:3y{u,x,y)£H}\ = Ni_i4x). (4) 

yeS'ix) 

The matrix M, obtained by choosing one representative from each of the equivalence classes 
of ~jj and is also a rank-one block matrix. Moreover, we know the block structure, row and 
column sums of M, from pv^ jR, ^ij, ® and Hence, by Lemma [271 we can reconstruct 
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all the entries of M . Then, using pr^ jR, and we can reconstruct the matrix M. Finally 
we compute the row sums, as in ([2]), to give the value of Nij{a) for each a € prjR. 

The time complexity of the algorithm is 0{n) for a given i and j, even in the bit-complexity 
model. Since there are 0{n'^) pairs i,j, the overall complexity is 0{n^). 

To complete the proof, we must show how to compute the congruences and ^j^i in 0{n^) 
time. We do this in the following section. □ 

The time complexity of this algorithm is 0{n^). However, observe that the time needed 
to compute F is already 0{mn^). We may assume that m = Q,{n) as, otherwise, there is a 
variable, xi say, which appears in no constraint. Thus, xi can be removed to give a relation 
Ri{x2, ■ ■ ■ ,Xn) such that \R\ = q\Ri\. Therefore, the time complexity of the counting algorithm 
is no worse than the 0{mn'^) cost of computing the frame F. 

7.2 The congruences ~j j and ~j j 

We now prove that the relations ~j j- and used in the proof of Lemma [M] are congruences 
and that they can be computed efficiently. Let F be strongly rectangular and let R be an n-ary 
relation determined by a F-formula For 1 < « < j < n, recall that 

(i) a b {a,b £ prjR) if there are t,t' £ R such that prjjjt = pr^jjt', tj = a and t'j = b; 

(ii) a ^j^i b {a,b £ pr^R) if there are t,t' £ R such that prjt = prjt', ti = a and t'- = b, 
where J = [i — 1] U {j}. 

Lemma 35. For all 1 < i < j < n, ~jj and ^j^i are congruences in {R). 

Proof. Consider the binary relation B defined by -B(u, y) = 3zi, Z2 -R(u, zi, y, Z2) on pr^jjiix pr^i?. 
This is rectangular so induces a congruence 62 on pfjR by Corollary [3j This congruence is 

The proof for is similar, using B defined by i?(u, y) = 3zi, Z2 i?(x, y, zi, Z2) on ptjR x 
prji?, where u = (x, tu). □ 

Lemma 36. The set of congruences and ^j^i (1 < i < j < n) can be computed in 0{n^) 
time. 

Proof. We compute the relations with i < j, as follows. From the frame F, we compute 
pri jR. For each b £ pr^R, this gives a tuple t such that pr^t = b. We now use Corollary \T7\ to 
compute a frame F* for . . . , tj, Xj+i, . . . , x„) in 0{n^) time. Now prjF* gives the equiva- 
lence class of containing b. We repeat this procedure, as in the proof of Lemma [T9l until 
we have determined all the equivalence classes. 

There are 0{n'^) pairs i, j with i < j and computing each requires 0{n^) time. Thus, 
the we can compute all ~jj in 0{n^) time. 

Now consider the relations with i < j. For each a S pr^R, compute a frame -Fj^a for the 
relation Rj^a determined by $ A Xaixj). (Recall that Xa is the relation containing only a and 
we may assume that Xa S F by Lemma [9l) From Lemma [T9l we can do this in 0{n'^) time, so 
0{n^) time in total. Now, for each i < j, determine pr^ji?, using F. This requires 0{n) time 
for each pair i,j, so 0{n^) time in total. 

Now, for each block C of prj ji?, choose a £ pfjR so that (x, a) £ C for some x £ pr^R. Then 
the congruence ~j of Rj^a gives the equivalence classes of corresponding to C. These can be 
determined in 0{n) time using Fi^a- Thus, the total time to compute ^j^i for all pairs i,j with 
i < j is 0{n^). 

Hence the total time needed to compute all of these congruences is 0{n^). □ 
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8 Decidability 



Having shown that #CSP has a dichotomy, we must consider whether it is effective. That is, 
given a relational structure S = (D, T) can we decide algorithmically whether the problem 
#CSP(r) is in FP or is ^^P-complete? This is the major question left open in [3]. Here we show 
that the answer is in the affirmative. 

We will construct an algorithm to solve the following decision problem. 

Strong Balance 

Instance : A relational structure 6 = {D,T). 
Question : Is T strongly balanced? 

Recall from Section [2] that we may assume that ||r|| > q. Thus, we may take ||r|| as the 
measure of input size for Strong Balance. We bound the complexity of Strong Balance 
as a function of ||r||. Complexity is a secondary issue, since ||r|| is a constant in the nonuniform 
model for #CSP(r). In the nonuniform model, we are only required to show that some algorithm 
exists to solve Strong Balance. However, we believe that the computational complexity of 
deciding the dichotomy is intrinsically interesting. 

Our approach will be to show that the strong balance condition is equivalent to a structural 
property of F that can be checked in NP. 

We must first verify that F is strongly rectangular, since otherwise it cannot be strongly 
balanced, by Lemma [29l Thus, we consider the following computational problem. 

Strong Rectangularity 

Instance : A relational structure & = {D,r). 
Question : Is F strongly rectangular? 

Lemma 37. Strong Rectangularity is in NP. 

Proof. By Lemma [HI we can verify that a given function is a Mal'tsev polymorphism in 
OdlFll^) time. Thus, we select a function ip: — )• D nondeterministically in 0{q'^) = 0{\\T\\^) 
time and check that it is a Mal'tsev polymorphism in a further 0(||Ff ) time. □ 

The remainder of this section is organised as follows. We first give definitions and notation 
that were held over from Section [2] because they are only used here. In Section 18.21 we give 
a characterisation of rank-one block matrices that we use in our decidability proof. The proof 
itself appears in Section 18.31 

8.1 Definitions and notation 

An equivalent but different view of CSP(F) from the one we have used is often taken in the 
literature. This is to regard <I> as a finite structure with domain V and relations determined 
by the scopes of the constraints. Thus, we have relations H, where 12, . . . , v) H if 
,Xi,2, ■ ■ ■ is a constraint. In this view, a satisfying assignment x is a homomorphism 
from $ to F. 

The following definitions and notation will be used in the remainder of this section. Let 
[Di — >■ D2] denote the set of functions from Di to D2- Then a homomorphism between two 
relational structures ©i = (-Di,Fi), 62 = (-02, F2) is a function a G [Di — )• D2] that preserves 
relations. Thus, for each r-ary relation Hi E Fi there is a corresponding r-ary relation H2 € F2 
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and, for each tuple u = (ui, . . . ,Ur) £ Hi, we have cr(u) = (a{ui), . . . ,a{ur)) G H2. We will 
write (t: ©1 — > S2 to indicate that ex is a homomorphism. 

Let [V ^ D] denote the set of all injective functions V ^ D and let \V o D] denote 
the set of all bijective functions V ^ D. If o": Si — >■ S2 and a £ [Di ^ D2], then a is 
called a monomorphism and we will write a: &i ^ &2- If cr is a bijective homomorphism 
and a^^ is also a homomorphism, then a is called an isomorphism and we write cr: ©1 O ©2- 
Then ©1, ©2 are isomorphic, so isomorphic structures are the same up to relabelling. An 
endomorphism of a relational structure © is a homomorphism o" : © ^ © and an automorphism 
is an isomorphism o": © -f-)- ©. Note that the definition of an endomorphism is identical to 
that of a unary polymorphism. Note also that [D ^ D] = [D ■H' D], since D is finite, so an 
injective endomorphism is always an automorphism. Clearly, the identity function is always an 
automorphism, for any relational structure ©. 

We use the following construction of powers of & (see, for example, \25l p. 282]). For any 
relational structure © = (^,r) and /c € N, the relational structure ©'^ = {D^,T^) is defined 
as follows. The domain is the Cartesian power D^. The constraint language is such that, 
for each r-ary relation H £ T, there is an r-ary € T^, which is defined to be the following 
relation. If Uj = Ui,2i • • • 1 G {i £ H)) then (ui, U2, . . . , u^) € if, and only 

if, (uij, U2,j, . . . , Mrj) £ H for all j G [k]. Now, if ^ is a pp-formula in T, we define the 
corresponding formula to be identical to ^ , except that each occurrence of G T is replaced 
by the corresponding relation G V^. Observe that the relation ^'^ is actually pp-definable in 
r, by the formula ^'^(x) = ^'(xi) A ^(x2) A • • • A ^'(xfc), where Xj [i G [k\) are disjoint n-tuples 
of variables. In particular, we have |^''^| = |^[^ 

Using this construction, the definition of a polymorphism can be reformulated. In this view 
of CSP(r), it follows directly that a fc-ary polymorphism is just a homomorphism ip: ©'' — )• ©. 



8.2 Rank-one block matrices 

In our decidability proof, we use a different characterisation of rank-one block matrices, given 
by Corollary HOl This may seem more complicated than the original definition but it is more 
suited to our purpose. 

Lemma 38. A matrix A is a rank-one block matrix if, and only if, every 2x2 submatrix of A 
is a rank- one block matrix. 

Proof. Let Ahe a, k x i rank-one block matrix and let 



B 



{i,j G [k], i^ j;r,s £ [I], r^s) 



be any 2x2 submatrix of A. If any of air, Ois, Ojr, Ojs is zero, at least two must be zero, since 
A is rectangular. Then B is clearly a rank-one block matrix. If Oir, ais, ajr, ajg are all nonzero, 
B must be a submatrix of some block of A. Since this block has rank one, B also has rank one. 

Conversely, suppose A is not a rank-one block matrix. If its underlying relation is not 
rectangular, there exist air,ais,ajr > with ajg = 0. The corresponding matrix B clearly has 
rank 2, but has only one block so is not a rank-one block matrix. If the underlying relation of 
A is rectangular, then A must have a block of rank at least 2. This block must have some 2x2 
submatrix B with rank 2 and all its elements air,ais,ajr,ajs > 0. □ 
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Lemma 39. Let A he a rectangular 2x2 matrix. A is a rank- one block matrix if, and only if, 

2 2 2 2 

aiia220'120-2l — ai2'^2l'^ll'^22- 

Proof. This equation holds if any of an, 022, ^12 or 021 is zero. But then rectangularity impUes 
that at least two of them must be zero and ^ is a rank-one block matrix in all possible cases. 
Otherwise, the equation is equivalent to 011022 = 012O21, which is the condition that A is 
singular. So A is one block, with rank one. The argument is clearly reversible. □ 

Corollary 40. Ket A he a rectangular k x i matrix. A is a rank-one hlock matrix if, and only 
if, af^ajgUisajr = af^a^.^airajs for all i,j G [k] and all r,s £ [£]. 

Proof. When i = j or r = s, the two sides of this equation are identical. Otherwise, the equality 
follows directly from Lemmas [38l and [39l □ 

Remark 9. It is possible to modify the above so that Corollarv 1401 involves products of only five 
elements, rather than six, but we do not pursue this refinement. 

8.3 Decidability 

To show the decidability of strong balance, we relax the criterion of strong balance, by noting the 
conditions sufficient for the success of the algorithm in Section [7.11 Observe that only ternary 
relations on D x D x D\ for i G [n — 2], are required to be balanced. Therefore, let ^'(x), with 
X = (xi, . . . , be an arbitrary formula pp-definable in F, which we consider fixed for the rest 
of this section. Then, for the algorithm to succeed, it suffices that the qx q matrix 

M(a, h) = |{x G [y ^ D] : X G xi = o, X2 = 6}| (Va, b D) 

is always a rank-one block matrix. Note that we can always assume that the underlying relation 
of M is rectangular, since T is known to be strongly rectangular. 

Remark 10. Call this condition almost- strong balance. It is equivalent to strong balance if 
FP 7^ #P. If S is strongly balanced, it is clearly almost-strongly balanced. Almost-strong 
balance implies that the algorithm of Section [7.11 succeeds, which implies that 7^CSP(r) G FP. 
Thus 77^CSP(r) is not T^^P-complete, which implies that it is strongly balanced by Lemma [3T1 
This chain of implications requires FP 7^ #P, so we make that assumption in the remainder of 
this section. If FP = ^^P, no dichotomy exists and the property of strong balance ceases to be 
of computational interest. 

We may therefore take almost-strong balance as the criterion for strong balance. By Corol- 
lary [JOl the condition for M to be a rank-one block matrix is that 

M{a,cfM{a,d)M{h,dfM{b,c) = M{a,df M{a,c)M{b,cf M{h,d), (5) 

for all a, b,c,d G D. 

We can reformulate the condition for strong balance using the construction of powers of (3. 
If a = (ai, . . . , fl/c) and b = (61, . . . , 6/;), the balance matrix for ^'^ is the x q^ matrix 

Mfe(a,b) = |{x G[V ^ D''] : X G xi = a,X2 = b}| 
= M{ai,bi)M{a2,b2) ■ ■ ■ M{ak,bk) . 
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Using this, equation ([5]) can be rewritten as 



Me{a,c) = Me{a,d), (6) 

where 

a = {a,a,a,b,b,b), c = {c,c,d,d,d,c), d = {d,d,c,c,c,d) . (7) 

Fix a, c, d and, for notational simphcity, write & for S^, F for F^, ^ for M for Mg and -D 
for D^. Then, from we must verify that M(a, c) = M{d,d) for all relations 'J' which are 
pp-definable in F and given a,c,d G D. We use a method of Lovasz [23j; see also [15]. For 
s e D, let 

Homa_5(^) = {x G [y D] : X G j;i = a, j;2 = s} 
homa,5(^) = |Homij,,-(^)|. 

However, a homomorphism V D that is consistent with ^ is just a satisfying assignment 
to ^. M{a,s) is the number of such assignments with xi = d and X2 = s, i.e., the number of 
homomorphisms that map xi a and X2 s. This proves the following. 

Lemma 41. F is strongly balanced if, and only if, homa^c(^) = hom^ for all formulae ^ 
and all d, c, d of the form above. 

We will also need to consider the injective functions in Homa^g(^). For s D, let 

MonQ,g(^) = {x e [V ^ D] : X e ^ , xi = d, X2 = s} 
mona,s{^) = \Mona,s{^)\ . 

Lemma 42. homa^c(^) = hom^ j(^) for 0,11 ^ if, and only if, mona,c(^) = mon^ di^) for oil ^ . 

Proof. Consider the set Z of all partitions / of ^ into disjoint classes Ji, . . . , 7^^-, such that 1 G /i, 
2 G I2. Writing I ^ I' whenever / is a refinement of P = (X, ^) is a poset. We will write _L 
for the partition into singletons, so ± ^ / for all I G I. 

Let V/I denote the set of classes h, ■ ■ ■ ,Ikj of the partition /, so \V/I\ = ki, and let 7i, I2 
be denoted by 1//, 2/1. Let ^ /I denote the relation obtained from ^ by imposing equality on 
all pairs of variables that occur in the same partition of I. Thus, the constraints xi = d, X2 = s 
become Xijj = d, X2/1 = s. Then we have 

hom;,,s(^') = hom^,5(^/±) =^mon^,5(^'//) =^mon-a4^/I)C{±,I) , (8) 

lex lex 

where (^{1,1') = 1, if / ^ /', and = 0, otherwise, is the C-function of the poset P. Thus, 

if mona,c(^) = mon^ di^) for all ^, then 

hom,,e-(^') = J]mon,,g(f'//)C(±,/) = mon,,,-(^//)C(±, /) = hom.^jW . (9) 
lex lex 

More generally, the reasoning used to give ([8]) implies that 

hom^,,(^'/I) = ^ mon^,5(^//') = mon-a^^ /l')C{I, I') . 
i^r I'ex 
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Now, Mobius inversion for posets \30\ Ch. 25] implies that the matrix C,: X xX ^ {0, 1} has an 
inverse /Li: X x I — )■ Z. It follows directly that 

mona,5(^') = ^ homa,^-(^'//)/i(_L, /) . 

lex 

Thus, if homa_c(^) = hom^ for all then 

mon,,e(^) =^hom,,,-(^//)^(±,/) = ^ hom^^j(^'//)^(±, /) = mon.^jC^-) . (10) 
lex lex 

Now, ([9]) and (fTO|) give the conclusion. □ 

Lemma 43. mona,c(^) = ™on^j('I'), /or a// if, and only if, there is an automorphism 
rj:D-^Dof& = {D,T) such that r]{a) = a and rj{c) = d. 

Proof. The condition holds if 6 has such an automorphism since, if '^(x) = 3y ^(x, y) for some 
l>, then 

mons,g(f') = |{x € [y 5] : xi = a, X2 = c, 3y (x, y) S $}| 

= |{??(x) e[V D]:xi= r]{a), X2 = vie), 3y (7?(x), r/(y)) G ^ 
= |{x e[V^ D]:xi=a,X2 = d, 3y (x,y) € ^}\ 
= monj;j(^'). 

For the converse, suppose we have mona^c(^) = ™oiia j(^) ^- Consider the following 

F-formula ^ with domain D and variables Xj (i (z D), 

^(x) = /\ /\ ^(xsi,...,xsj. 

/fef [ui,...,ur)eH 

Then 

Mons,i(l') = {x G [Z) Z)] : x^ = a, Xg = s, x € ^} . 

We have Mona^c($) 7^ 0, since the identity assignment Xi = i {i & D) is clearly satisfying. Thus, 
by the assumption, Mon^j($) 7^ 0. Let r] € Mon^ so r] is an endomorphism of (3 with 

r/(a) = a, r/(c) = d. Since [D D] = [D -h^ D], r]: D ^ D \s the required automorphism. □ 

Corollary 44. S = (L), T) is strongly balanced if, and only if, for all a,b,c,d & D and a, c, d as 
defined in ([7]), 6 = {D,T) has an automorphism rj such that 77(a) = d and r]{c) = d. 

Proof. This follows from ([6]) and Lemmas HTl H2] and |43j □ 

This characterisation of strong balance leads to a nondeterministic algorithm. 
Theorem 45. Strong Balance is in NP. 

Proof. We first determine whether T is strongly rectangular, using the method of Lemma [371 If 
it is not, then T is not strongly rectangular by Lemma [29l 

Otherwise, we can construct (3 = (-D,f) in time OdlFH^). Let q = = \D\ and let 11 
denote the set of q\ permutations of D. Each tt G 11 is a function ir: D ^ D and so a potential 
automorphism of &. For each of the q^ possible choices a, b,c,d £ D, we determine a, c, J € D 
in polynomial time. We select vr G 11 nondeterministically and check that 7r(a) = a, 7r(c) = d 
and that vr preserves all -ff € f . The computation requires 0(g^||f p) = 0(||F||"^^) time in 
total, so everything other than the 0{q^^) = OdlFH^*^) nondeterministic choices can be done 
deterministically in a polynomial number of steps. □ 
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Remark 1 1 . We have paid little attention to the efficiency of the computations in Theorem USj 
If the elements of D are encoded as binary numbers in [q], comparisons and nondeterministic 
choices require 0{logq) bit operations, rather than the 0(1) operations in our accounting. On 
the other hand, membership in can be tested in Odli^H) comparisons, rather than the 
0(||iJ||^) that we have allowed. This might be reduced further by storing H in a suitable data 
structure, instead of a simple matrix. We could also use Remark [9] to improve the algorithm of 
Theorem USj 

Remark 12. Theorem 1451 and Lemma [32] together imply that the following problem, posed by 
Bulatov [3], can also be decided in NP. 

Congruence Singularity 

Instance : A relational structure S = {D,T). 
Question : Is T congruence singular? 

Whether this can be shown directly, and not via Strong Balance, remains open. 

9 Conclusions 

We have shown that there is an effective dichotomy for the whole of #CSP. We have given a 
new, and simpler, proof for the existence of the dichotomy and the first proof of its decidability. 

The complexity of our counting algorithm is 0{n^), whereas algorithms for most known 
counting dichotomies are of lower complexity, often 0{n). Can the complexity of the general 
algorithm be improved to O(n^), or better? Since frames, on which the algorithm is based, have 
size 0{n), there is no obvious reason why this cannot be done. 

A second problem that we have not yet considered is an extension to a dichotomy for weighted 
counting problems [SKUj. We believe that this is possible. In fact, a dichotomy for rational 
weights has already been shown in [7]. This gives an indirect argument, using the unweighted 
dichotomy. Decidability of the dichotomy of [7] now follows from Section [8] of this paper. 

A third issue is to investigate whether known counting dichotomies can be recovered from 
these general theorems. We have some preliminary results in this direction. The characterisation 
of Lemma |43] appears to be useful in this respect. 

A fourth problem is to determine the complexity of Strong Balance more precisely, rather 
than just establishing membership in NP. Strong Balance seems unlikely to be NP-complete 
as the automorphism tests required can be coded into a single instance of the graph isomorphism 
problem. However, it is not obvious whether the converse reduction is possible so it may be that 
Strong Balance is in P. 

Finally, a deeper question that arises from our work is to what extent the detailed properties 
of the algebras associated with CSP instances are of real significance. In recent years, the 
algebraic approach has proven successful in the study of CSP, but it is possible that these 
algebras are more complicated objects than the relations they are intended to capture. 

Note. Since this paper was written, Cai, Chen and Lu have extended and strengthened our 
methods to give an effective dichotomy for the weighted counting problem [9]. 
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